High-Dimensional Bayesian Clustering with Variable Selection: TheRPackagebclust
نویسندگان
چکیده
منابع مشابه
Bayesian Variable Selection in Clustering High-Dimensional Data With Substructure
In this article we focus on clustering techniques recently proposed for highdimensional data that incorporate variable selection and extend them to the modeling of data with a known substructure, such as the structure imposed by an experimental design. Our method essentially approximates the within-group covariance by facilitating clustering without disrupting the groups defined by the experime...
متن کاملBayesian Variable Selection in Clustering High-Dimensional Data
Over the last decade, technological advances have generated an explosion of data with substantially smaller sample size relative to the number of covariates (p n). A common goal in the analysis of such data involves uncovering the group structure of the observations and identifying the discriminating variables. In this article we propose a methodology for addressing these problems simultaneousl...
متن کاملHigh-Dimensional Bayesian Clustering with Variable Selection: The R Package bclust
The R package bclust is useful for clustering high-dimensional continuous data. The package uses a parametric spike-and-slab Bayesian model to downweight the effect of noise variables and to quantify the importance of each variable in agglomerative clustering. We take advantage of the existence of closed-form marginal distributions to estimate the model hyper-parameters using empirical Bayes, t...
متن کاملOn the Computational Complexity of High-Dimensional Bayesian Variable Selection
We study the computational complexity of Markov chain Monte Carlo (MCMC) methods for high-dimensional Bayesian linear regression under sparsity constraints. We first show that a Bayesian approach can achieve variable-selection consistency under relatively mild conditions on the design matrix. We then demonstrate that the statistical criterion of posterior concentration need not imply the comput...
متن کاملConsistent high-dimensional Bayesian variable selection via penalized credible regions.
For high-dimensional data, particularly when the number of predictors greatly exceeds the sample size, selection of relevant predictors for regression is a challenging problem. Methods such as sure screening, forward selection, or penalized regressions are commonly used. Bayesian variable selection methods place prior distributions on the parameters along with a prior over model space, or equiv...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Statistical Software
سال: 2012
ISSN: 1548-7660
DOI: 10.18637/jss.v047.i05